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Abstract 

Experiments in recent years have vividly demonstrated that gene expression can be highly 
stochastic. How protein concentration fluctuations affect the growth rate of a population of cells, 
is, however, a wide open question. We present a mathematical model that makes it possible to 
quantify the effect of protein concentration fluctuations on the growth rate of a population of ge- 
netically identical cells. The model predicts that the population's growth rate depends on how 
the growth rate of a single cell varies with protein concentration, the variance of the protein con- 
centration fluctuations, and the correlation time of these fluctuations. The model also predicts 
that when the average concentration of a protein is close to the value that maximizes the growth 
rate, fluctuations in its concentration always reduce the growth rate. However, when the average 
protein concentration deviates sufficiently from the optimal level, fluctuations can enhance the 
growth rate of the population, even when the growth rate of a cell depends linearly on the protein 
concentration. The model also shows that the ensemble or population average of a quantity, such 
as the average protein expression level or its variance, is in general not equal to its time average 
as obtained from tracing a single cell and its descendants. We apply our model to perform a 
cost-benefit analysis of gene regulatory control. Our analysis predicts that the optimal expression 
level of a gene regulatory protein is determined by the trade-off between the cost of synthesizing 
the regulatory protein and the benefit of minimizing the fiuctuations in the expression of its target 
gene. We discuss possible experiments that could test our predictions. 
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Author summary 

Living cells use regulatory networks in order to respond to a changing environment. They use 
gene regulatory networks, for example, to adjust the optimal expression levels of metabolic 
enzymes in response to changing sugar concentrations. Both the regulatory networks and 
metabolic networks of living cells are often highly stochastic. However, how protein concen- 
tration fluctuations affect the growth rate of a population of cells is largely unknown. We 
present a mathematical model that makes it possible to predict how protein concentration 
fluctuations affect the population's growth rate. The model predicts that when the expres- 
sion level of a protein is close to the value that maximizes the growth rate, fluctuations will 
always reduce the growth rate. However, if the average protein expression level deviates 
sufficiently from the optimal one, then fluctuations can enhance the population's growth 
rate. The reason is that cells that happen to grow faster will dominate the population. We 
also apply our model to investigate the optimal design of a regulatory network. Our analysis 
predicts that this is determined by the trade-off between the cost of synthesizing the proteins 
that constitute the regulatory network, and the benefit of reducing the fluctuations in the 
network that it controls. 
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Introduction 

Cells continually have to respond and adapt to a changing environment. One important 
strategy to cope with a fluctuating environment is to sense the changes in the environment 
and respond appropriately, for example by switching phenotype or behavior. Arguably the 
most studied and best characterized example is the lac system, where the Lad repressor 
measures the concentration of lactose and regulates the expression level of the metabolic 
enzyme that is needed to consume lactose. In this strategy of responsive switching, it 
is critical that cells can accurately sense and respond to the changes in the environment 
However, both the detection and the response are controlled by biochemical networks, 
which can be highly stochastic QB,flflaQQflHn one .i,Kt expect .hat 
noise is detrimental, since it can drive cells away from the optimal response curve — the 



optimal enzyme concentration as a function of the lactose concentration On the other 
hand, both reducing noise and creating a regulatory network that allows cells to respond 



optimally can be energetically costly jl2[], which would tend to reduce the fitness of the 
organism In this paper, we present a model that makes it possible to quantify the 

effects of biochemical noise on the growth rate of a population of cells that respond via 
the mechanism of responsive switching. We then use this model to perform a cost-benefit 
analysis of gene reg ulatory control, using cost and benefit functions that have been measured 
experimentally 12l |. This analysis, which complements recent work by Kalisky and coworkers 
14| . predicts that gene regulatory proteins exhibit an optimum expression level, which is 
determined by the trade-off between the cost of synthesizing the regulatory protein and the 
benefit of reducing the fluctuations in its target gene. 

It has long been recognized that organisms in a clonal population can exhibit a large 
variation of phenotypes. Within highly inbred lines, for instance, phenotypic variation can 
still be detected 15|]. More recently, experiments have vividly demonstrated that gene ex- 



pression in uni- and multicellular organisms fluctuates strongly [2|, sl, 0, I^, Q, I^, ol, l^, 
The fact that fluctuations are not selected out, suggests that the optimal fitness requires a 
certain amount of biochemical noise. However, how the growth rate of a population depends 
upon biochemical noise is still poorly understood. In a constant environment, stabilizing 
selection favors a genotype that leads to a narrow phenotype distribution centered around 

nn 

the optimal phenotype in that environment p^, 116| . However, cells do not live in a constant 
environment, but rather in one that fluctuates. While one strategy to cope with environ- 
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mental fluctuations is to detect and respond to them (responsive switching), an alternative 
one is to create diversity in the population. This can be achieved via the mechanism of 



stochastic switching 



17, 



18 



19 



20|, whereby members of the population randomly flip be- 



tween different phenotypes due to biochemical noise. This strategy is particularly efficient 
when the time scales of the environmental fluctuations are either very long, such that the 
investments of constructing an energetically expensive response machinery do not pay off 
[20I . or very short, i.e. shorter than the time it takes for the population to respond to 



them [18 



19(1 . Many examples of this strategy exist in nature 
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22l |. and this strategy 



20[. However, the dominant 



has recently been studied in much theoretical detail 3, 3, 
strategy for coping with changes in pH, temperature, the food supply or the presence of 
various toxic chemicals appears to be responsive switching. In this paper, we will present 
a generic model that makes it possible to quantify the effect of biochemical noise on the 
growth rate of a clonal population of cells that use this mechanism to respond quickly to 
changes in the environment. 

Our model integrates a description of how the internal dynamics of the composition of 
a cell affects the growth rate of that cell with a description of how the growth rates of the 
individual cells collectively determine the growth rate of the population. This allows us to 
address a number of fundamental questions: a) How does the growth rate of the population 
depend upon the growth rate of a single cell as a function of its protein expression levels? b) 
How does the population's growth rate depend upon the variance and the correlation time of 
these fluctuations? Our model predicts that an important parameter that controls the effect 
of biochemical noise is the correlation time of the fluctuations: only when the correlation 
time is long compared to the cell cycle time, does biochemical noise affect the growth rate 



of the population. Interestingly, recent experiments on E. coli [8| and human cells [11| have 
revealed that the correlation times of protein concentration fluctuations can be on the order 
of the cell cycle time, or even longer. Our analysis thus predicts that biochemical noise can 
signiflcantly effect the growth rate of a population of cells. Moreover, our model predicts 
that fluctuations can both enhance and reduce the population's growth rate. When the 
average expression level of a protein is close to its optimum, fluctuations in its concentration 
will reduce the population's growth rate. However, when it is sufficiently far from its optimal 
level, fluctuations can actually enhance the growth rate of the population. This effect arises 
at the population level and is a consequence of the fact that cells that happen to growth 
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faster due to noise, become overrepresented in the population. 



23|. 



Our analysis highlights the difference between ensemble averages and time averages 
The ensemble or population average of a quantity such as protein noise is defined as the 
average of that quantity over the cells in the population at a given moment in time; when 
a large population exhibits stationary growth, this average does not change with time. The 
time average of a quantity is defined as the average of that quantity in a single cell and its 
descendants over time. The time average is a property of the intracellular biochemical net- 
work: its value only depends upon the dynamics of the protein concentrations. In contrast, 
in experiments often the ensemble average is measured . Our analysis elucidates 

that the ensemble average of a quantity not only depends upon the dynamical properties of 
the network, but also on whether fluctuations of this quantity couple to the growth rate of 
the cells. 

The model also allows us to perform a cost-benefit analysis of regulatory control. Re- 
cently, Dekel and Alon performed a series of experiments that strongly suggest that protein 
expression is the result of a cost-benefit optimization problem 12|]. They showed that the 
expression level of the lac operon is determined by the trade off between the cost of syn- 
thesizing the metabolic enzyme LacZ and the benefit this enzyme confers in enabling the 
consumption of the sugar lactose. In particular, they developed a cost-benefit analysis that 
allowed them to successfully predict the optimal average expression level of the operon as a 
function of the lactose concentration. However, this analysis does not answer the question 
how the growth rate depends upon the fluctuations in the expression level of the metabolic 
enzyme, nor does it answer the question what determines the optimal average expression 
level of the gene regulatory protein that regulates the expression level of the metabolic 
enzyme. 

While the cost function of synthesizing a gene regulatory protein is probably similar to 
that of producing a metabolic enzyme, their benefit functions are fundamentally different. 
The benefit of producing a metabolic enzyme is that it allows the uptake of the sugar by the 
metabolic network. In contrast, the benefit of synthesizing a regulatory protein is indirect 
and is derived from that of the metabolic enzyme; synthesizing a regulatory protein can be 
beneficial because it allows the cell to adjust the expression level of the metabolic enzyme 
to its optimum in response to a changing sugar concentration. However, a given optimal 
expression level of the metabolic enzyme as a function of the sugar concentration, does not 
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uniquely determine the optimal expression level of the regulatory protein. A given optimal 
response function of the enzyme expression level as a function of the sugar concentration, 
can be obtained by different combinations of parameters such as the binding affinity of the 
inducer to the regulatory protein, the binding strength of the regulatory protein to the 
DNA, the degree to which these molecules bind cooperatively with each other, as well as the 
total concentration of the regulatory protein (see Figure [3]). What determines the optimal 
combination of these parameters that all can yield the same response curve of the enzyme 
expression level as a function of sugar concentration? 

We conjecture that the benefit function of the regulatory protein is determined by the 
fluctuations in the expression level of its target, the metabolic enzyme, although other 
factors such as the response time could play a role as well. As we will show, when the average 
expression level of the metabolic enzyme is close to its optimum, fluctuations will tend to 
reduce the population's growth rate. Different gene regulatory networks can yield the same 
average response function, but can have markedly different noise properties. In particular, 
our analysis predicts that the inducer, e.g. sugar, should bind the gene regulatory protein 
strongly. Moreover, it predicts that higher expression levels of the regulatory protein lower 
the noise in the expression level of the metabolic enzyme. We therefore predict that the 
optimal expression level of a regulatory protein is determined by the interplay between the 
cost of making the regulatory protein and the benefit of reducing the fluctuations in the 
target gene. Recently, a similar idea has independently been proposed by Kalisky, Dekel 
and Alon Using as inputs the cost and benefit functions as measured by Dekel and 



Alon p^, our model predicts that the optimal expression level of the lac repressor should 



be on the order of 10-50 copies, which is remarkably close to the level found in vivo 24 1. 



Results 

Growth rate 

In order to describe the effects of biochemical noise on the growth rate of a population of 
cells, we have to develop a model that describes how a) the internal dynamics of a cell affects 
the growth rate of that cell and b) how the latter affects the growth rate of the population 
of cells. We now first discuss the latter. 

The growth rates of single cells and the growth rate of the population In order to quantify the 
growth rate of a cell, we have to define a parameter that monitors the progress along the cell 
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cycle. This parameter, Z, could be the amount of replicated DNA, the length of the cell, or 
a combination of these parameters. It has a value Z = Z\sX the beginning of the cell cycle 
and a value Z = Zi sX the end of the cell cycle. The value of the 'cell cycle coordinate' Z 
thus exhibits an oscillatory sawtooth pattern as a function of time. Its role is analogous to 
that of a reaction coordinate in chemical kinetics, which measures the progress of a chemical 
reaction and serves to define the chemical rate constant. In our case, Z serves to quantify 
the instantaneous growth rate. A, of each cell in the population: 

dt ^ ' 

The growth rate A depends upon the composition of the cell. This is determined by 
the expression level of ribosomal proteins, which are needed to make new proteins, and the 
expression levels of metabolic enzymes and other non-ribosomal proteins, which are required 



to produce the building blocks for protein synthesis and cell growth [25|. We denote the 
concentrations of these different proteins by {Xi, X2, . . . , Xn-i^ Xn\ = X. The growth rate 
A is thus a function of X: A = A(X). Together with the cell cycle coordinate Z, X specifies 
the state of each cell in the population. 

To determine the growth rate of a population of cells, a key quantity is the probability 
density P{Z, X, t) to find a cell with a certain state Z, X, inside the population. The 
evolution of this probability density can be expressed in operatorial form as 

ap(z,x,t) 



P{Z,X,t). (2) 



dt 

The first term on the right-hand side describes the evolution of P{Z, X, t) due to the deter- 



ministic evolution of Z (see Equation [T]) ; it corresponds to a Fokker-Planck operator (26|| in 
the limit of zero noise. The operator Hx is the Fokker-Planck operator encoding the evolu- 
tion of P{Z,X,t) resulting from the noisy dynamics of the composition X. The last term 
describes the effect of cell division on the probability density P(Z, X, t). Indeed, the cell 
division at Zf amounts to a "dilution" of the probability of finding cells with intermediate Z 
values. The steady-state probability distribution function, Ps{Z, X, t), satisfies the equation 



-±X(X) + Hx-9 



PsiZ,X,t), (3) 



= 

with the boundary condition 

2P,{Z,,X,ti)=P,{Z,,X,t,). (4) 



This condition formalizes the observation that upon cell division a cell at the end of the cell 
cycle gives birth to two newborns. Importantly, g is the growth rate of the population of 
cells in steady state. In this "stationary state" , the number of cells in the population grows 
exponentially, but the fraction of cells P{Z,X.) with internal states Z, X has converged to a 
time-invariant quantity. At each moment in time, there is a constant fraction of cells ready 
to undergo cell division; the number of cells undergoing cell division thus grows exponentially 
with time, but remains proportional to the population size, with the proportionality factor 
given by the growth rate g. 

The growth rates of single cells and protein concentration fluctuations The above model is a 
generic model of the cell cycle. To make further progress, we have to specify the dynamics 
of X. The copy number of a protein will increase as the cell grows, and will (on average) 
be divided in half when the cell divides. The copy number will thus exhibit an oscillatory 
temporal profile. The volume of the cell will show similar oscillatory dynamics. These 
oscillations will tend to cancel each other in their ratio, the concentration of the protein. We 
make the simplifying assumption that the concentration of each species fluctuates around a 
constant steady-state level during the cell cycle, and that the amplitude of these fluctuations 
is small. It allows us to linearize the interactions between the different species at steady 
state, and to use the linear-noise approximation a comparison with a description based 
on the chemical master equation has shown that this approximation is surprisingly accurate, 
even when the copy numbers are as low as ten Q]- yields the following set of chemical 
Langevin equations: 

n 

Xi = - ^ fijXj + rii, \/i. (5) 
i=o 

Here, Xi = Xi — Xs^i is the deviation of the concentration Xi of species i away from its 
steady-state value Xs^i, and fij corresponds to the coupling between species i and j. The 
term describes the noise in Xj that arises from the stochastic character of the chemical 
reactions. We model it as Gaussian white noise, with zero mean and variance determined by 
the concentrations of the species at steady state. In Equation ([2]), the relevant probability 
density now becomes P(Z, x, t) and the operator that describes the evolution of P(Z', x, t) 
due to the Langevin dynamics of x, becomes ifx (see Methods). 

If the composition of the cells would not fluctuate in time, then the evolution of the cell 
cycle parameter Z would be deterministic. The growth rate A(X) of each cell would then 
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be constant in time, A(X) = Aq, and proportional to the growth rate of the population, 
Ao ~ g. In the presence of biochemical noise, the growth rate not only depends upon the 
average protein levels, X, but also upon the fluctuations around the average, x, which lead 
to variations in the growth rate. It is conceivable that the growth machinery responds slowly 
to fluctuations in the composition in the cell; the growth rate would then "average" over 
fluctuations in the composition over some characteristic time scale r: A = A(Xs,x'^), where 
the bar with the superscript r indicates that the fluctuations in x are averaged over a time 
T. However, experiments have revealed that protein concentrations fluctuate fairly slowly: 
for E. coli, the correlation time is on the order of 45 min, which is on the order of the cell 
cycle time js]. We argue that since the protein concentrations relax slowly, it is reasonable 
to assume that the instantaneous growth rate depends upon the instantaneous composition 
of the cell. We therefore conjecture that the growth rate is given by A = A(Xs,x). 

To obtain the growth rate A(Xs,x), we expand it around the steady state Xg to second 
order in x 



The equation for the stead-state probability density Ps{Z,:K,t), Equation ([3]), can now be 
solved by making a multidimensional Gaussian Ansatz for Ps{Z, x, t) 



From now on we shall rescale the time and the Z coordinate such that Z{ — Zi = log(2). 
In order to understand why such a transformation is useful, it should be noted that in the 
absence of protein concentration fluctuations, each cell in the population needs a constant 
time between birth and division Tcydo = {Zf — Z{)/\q. At the population level, Tcyde is 
also the time it takes for the population to double in size, such that the growth rate of 
the population is g = log(2)/Tcycic. Clearly, in the zero fluctuation limit, the growth rate 
of the population of cells equals the growth rate of each single in the population: g = \q. 
In the presence of protein concentration fluctuations, however, the cell cycle times of 
the individual cells will fluctuate, such that even a population of cells that are initially 
perfectly synchronized will eventually converge towards a steady-state distribution as given 
by Equation ([7]). 




(6) 




Z-Z: 



(7) 



9 



Time averages do not always equal ensemble averages 

Our model shows that the "time average" of a quantity such as the average protein expression 
level or the noise in gene expression, is, in general, not equal to its "ensemble average" 23|. 
The time average of a quantity X, X, is defined as the temporal average of X along one 
"line of descent": 

1 '■^ 



X=-| X{t). (8) 

Here, X{t) can be obtained by monitoring X as a function of time in a given cell, whereby 
upon cell division one follows a randomly chosen descendant. The integration time T should 
be much longer than the correlation time of the fluctuations in X. To obtain better statistics, 
one could average over different trajectories X{t) in a population, but each such path has to 
have a different ancestor (the first cell on the path). The ensemble average of the quantity 
X, (X), is defined as the average of X across the population of cells: 

where N{t) is the number of cells in the population at time t and Xa{t) is the magnitude 
of X in cell a at time t\ when the growing population is in the stationary state and 
P{Z,^,t) is time invariant, this ensemble average does not change with time. To illustrate 
the difference between the two kinds of averages, let's consider the fluctuations in the 
composition X. To the extent that protein concentration fluctuations are described by 
the chemical Langevin equation (Equation [5]), the distribution of the concentrations X as 
obtained by following the time traces of Xj in a given cell and its descendants, is given by 
a Gaussian that is centered at X = Xg. In contrast, the distribution of X over different 
cells in a population at a given moment in time is also a Gaussian, but now the Gaussian is 
centered at (X) = Xg + x*^, where x° may deviate from zero. Moreover, not only the mean, 
but also the variance of the two distributions will, in general, differ, as we will show now. 



Biochemical noise can both reduce and enhance the population's growth rate 

In order to understand the non-trivial effects of biochemical noise on the growth rate of a 
population of cells, it is instructive to consider a simple example. Let's consider a single 
metabolic enzyme X, and assume that the temporal dynamics of its concentration is given 
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by 

X = —•yx + r], (10) 

where x is the deviation of the enzyme concentration X away from its steady-state value, 
Xs, is the response time, which is typically on the order of the cell cycle time, and 77 is a 
Gaussian white noise term, of zero mean and strength 2D. The time average of the variance 
of the fluctuations in the concentration of X as obtained from the time trace of X of a given 
cell and its descendants, is X'^ — X^ = (^x = 

We assume that over the concentration range of interest, the growth rate of a given cell 
as a function of the expression level of X can be written as 

A = Ao(Xs) + ax + te^ (11) 

where Ao(Xs) is the growth rate of the cell when the enzyme concentration equals Xg. The 
growth rate of the population of cells is then given by (see Methods) 

3 = ^o{X^) + -^^ + ha\ (12) 

Here, cr^ is the variance of the fluctuations in X within the population of cells at a given 
time: = (X^) — (X)^. This ensemble or population average is given by 



2 



2D 



a 



+ ^i^-^hD 



7 



(13) 



The ensemble average can be written in terms of the time average of the variance, 

a\: = 7=^^=- Clearly, if the growth rate is non-linear in X, i.e. if 6 ^ 0, the 

ensemble average of the variance in X does not equal its time average. Importantly, the 
time average of the protein noise, a§^, is a characteristic of the stochastic properties of 
the underlying biochemical network. However, the protein noise is often measured as an 



ensemble or population average 



fly, flfl. 



Our results show that if one is interested in 



the noise properties of the underlying network, one should compute the protein noise by 



combining sequential noise traces of cells through lines of descent (30[ when the expression 
of the fluorescent protein used to measure the noise affects the growth rate significantly 
(such that b is much smaller than zero). 

Let us now consider the scenario in which the average expression level of the enzyme is 
such that the growth rate is maximal: Xs = Xopt (see Figure [T]). In this case, a is zero, and 
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b = d'^X/dX^ < 0. The growth rate of the population is then g = Ao(-^s) + ba^. Since b is 
negative, g < Aq. Hence, when the composition is close to its optimum, biochemical noise 
always tends to reduce the overall growth of the population. 

If the average expression level Xs deviates significantly from the optimal expression level 
Xopt, the situation is qualitatively different (see Figured]). Sufficiently far away from the 
optimum, the curvature can be ignored {b = 0), and the growth rate is given hj g = 
= Ao+a^(Tx/7. In this regime noise always increases the growth rate, irrespective 
of the sign of a, and even though at the single cell level the growth rate A is linear in X. 
The reason is that cells that happen to have a composition that is closer to the optimum, 
will grow faster and therefore divide earlier; moreover, the daughter cells will inherit the 
composition from their mother, and will thus also grow faster than the steady-state value, 
and so on. As a consequence, cells with a higher growth rate become overrepresented in 
the population, which can be verified by noting that the mean of x in the population of 
cells is now shifted from zero to = Da/'j'^ = aa'^f-f. This mechanism, whereby the cells 
that grow faster due to a fluctuation in their protein composition generate more off-spring, 
increases the overall growth rate of the population. The increase in the growth rate due to 
noise, a'^a'^/'j, depends upon how strongly the growth rate changes with X, which is given 
by the slope a, and on the magnitude of the concentration fluctuations in each cell, given by 
cTx- Importantly, it also depends upon the relaxation time of the fluctuations, given by 7"^. 
If the response time is much faster than the cell cycle time, then on the relevant time scale 
of the cell cycle, the concentrations in all the cells will be the same and no benefit from the 

f] n 

noise can be gained. However, both in prokaryotic [8[ and eukaryotic cells [Ul], correlation 
times of protein concentration fluctuations have been measured to be on the order of the 
cell cycle time or longer, meaning that they are potentially important. Please also note that 
a non-zero x° means that the time average of X, which is given hj X = X^., is not equal to 
the ensemble average of X, which is given by {X) = Xg + x". 

Lastly, we note here that it is conceivable that the curvature b of the growth rate A is 
locally positive. In this case, the solution to Equation ( fT2l) is only valid when 7^ > AbD. 
At the point where this condition is no longer satisfied, an interesting bifurcation can arise 
towards a state where the growth dynamics alone imposes a bimodal distribution of protein 
concentrations: in the population, cells with a high expression level then co-exist with cells 
with a low expression level. 
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Fluctuating environment 

The analysis above describes how fluctuations in the composition can affect the growth rate 
of a population of cells in a constant environment. We now briefly discuss how fluctuations in 
the environment affect the population's growth rate. As before, we consider the scenario in 
which cells respond to changes in the environment via the mechanism of responsive switching: 
they thus sense the changes in the environment and respond appropriately. 

If the environmental signals are described by the vector S, then the time varying envi- 
ronment can, in general, be decomposed as: 

S = S= + S". (14) 

Here, S*^ denote the correlated fluctuations between the different cells, while corresponds 
to the fluctuations in the environmental signals that are uncorrelated from one cell to the 
next within the population. 

The uncorrelated fluctuations in the external signals can be treated in the same spirit as 
the fluctuations in the internal signals. Their dynamics could be added to that of x: 

= -/iiS^ + ^i, i = l...m, (15) 

n m 

Xi = fijXj + ^ gijs] + r]i, i = l...n, (16) 

j=Q j=0 

where = Sf — S^^, with the part of the fluctuations of the external signal i that is 
uncorrelated between different cells, and Qij indicates how the internal dynamics of species 
i is coupled to the fluctuations in the external signal j. Since the fluctuations in couple 
to the fluctuations in the composition X, they could either reduce or enhance the growth 
rate of the population, depending on whether the composition X is close to its optimum or 
not, respectively. 

The effect of the correlated fluctuations in the external signals, S'^, are much more difficult 
to treat analytically 13]. However, if these fluctuations occur on a time scale that is much 
longer than the time it takes for the internal dynamics x to relax towards a new steady state 
after an environmental change, the overall growth rate can be written as 

g = J dS^P(S=)(7(S=). (17) 

This expression shows that the cells need to adapt to a given distribution of external signals. 
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We can make an estimate for the time it takes for the population to relax towards a 
new steady after a change in the environment has occurred. If prior to an environmental 
change, the cell cycle coordinate Z has reached steady state, meaning that -P(.Z') is uniform 
across the population of cells, then PiZ) does not have to relax towards a new steady 
state after the change in the environment. The distribution in the composition, -P(X), 
however, does have to relax. If the relaxation time of the population is dominated by the 
slow dynamics of a single protein X, the relaxation rate is given by A; = ^J{i^-^hD). This 
shows that in the absence of fluctuations [D = 0) the relaxation rate is given by the rate 
of protein decay, 7, as one would expect. It also shows that when the growth rate of a 
cell is a concave function of X (6 < 0), fluctuations can actually enhance the relaxation 
rate; the reason is that cells that are closer to the new optimum will grow faster. This 
analysis shows that a conservative estimate for the validity of Equation f|T7|) is that the 
environmental fluctuations should occur on time scales longer than the protein decay time 7. 

The cost of reducing noise: optimal expression levels of gene regulatory proteins 

In order to understand the design criteria that determine the magnitude of the fluctuations 
in the expression level of a given protein for cells that respond via responsive switching, we do 
not only have to understand how these fluctuations affect the growth rate, as discussed above, 
but also the indirect energetic cost of controlling these fluctuations. Both the magnitude of 
the concentration fluctuations and the cost of controlling these fluctuations are determined 
by the design of the network that regulates the expression level of the protein of interest. We 
will now show, using the lac system as an example, that the optimal design of the regulatory 
network is determined by the interplay between these two factors. 

We use a simple model of the lac system in the absence of glucose but in the presence 
of lactose. The inducer lactose (ligand L) binds the lac repressor (transcription factor TF); 
upon binding, the transcription factor dissociates from the operator and the enzyme, LacZ 
in this case, is expressed. We assume that both the binding of ligand to the transcription 
factor and the binding of the latter to the operator are fast such that they can be integrated 
out. The dynamics of the regulatory protein and the metabolic enzyme is then specified as: 

X = -7x + ^x, 

e = -7e + /x + Ce- (18) 
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Here, x denotes the deviation away from the total steady-state TF concentration, denoted 
by Xsi e denotes the deviation away from the steady-state concentration of the enzyme, i?g, 
7 is the degradation rate of both proteins, and and model the (Gaussian white) noise 
in their expression. The factor / is the differential gain that describes the change in the 
protein production rate (expression rate) A;e(X) due to a change in the concentration of the 
transcription factor: / = dk-E,{X) / dX . In this expression we integrate the contributions of 
TF-ligand binding, TF-operator binding, and the dynamics of mRNA. The fluctuations in e 
have an intrinsic source, modeled by and an extrinsic one that arises from the fluctuations 
in X. Since the expression level of the enzyme is much higher than that of the gene regulatory 
protein, the dominant source of noise in e is the extrinsic one, arising from fluctuations in 
the TF concentration. In what follows, we therefore ignore the intrinsic contribution ,^e- 

To make further progress, we need to know how the growth rate of each cell. A, depends 
upon the expression level of the enzyme and that of the transcription factor. Recently, 



Dekel and Alon [12|] performed a series of experiments that allowed them to measure both 
the cost and the benefit of producing the metabolic enzyme LacZ. By using an artificial 
inducer, they varied the expression level of LacZ in the absence of its substrate lactose, and 
measured the effect on the growth rate. The inducer induces the production of LacZ, but no 
benefit is gained, since the lactose is absent and the inducer is not metabolized. This set of 
experiments thus allowed them to determine the cost of synthesizing the LacZ protein. In 
a separate set of experiments they measured how the growth rate changes with the lactose 
concentration, when the expression level is kept constant (due to a saturating amount of the 
inducer). This set of experiments gave them an (indirect) estimate of the benefit function. 
By assuming that the optimal expression level is given by the level that maximizes the 
benefit minus the cost, the measured cost and benefit functions could be used to predict the 
optimal LacZ expression level as a function of lactose concentration. 

Following Dekel and Alon we write the change in the growth rate of a single cell, 
AA = A — Aq, due to the production of the gene regulatory protein and the metabolic enzyme 
relative to the growth rate in the absence of these proteins, Aq, as: 

x(TP J. \ {E, + e + X, + x) 
^ = 5{E, + e)-v E^+e+x :^ - (19) 

The first term on the right-hand side encodes the gain in the growth rate due to the metabolic 
activity of the enzyme; importantly, 5 = S{L) is a function of the lactose concentration L 
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(see Equation [26] below) . The second term, with rj being a constant, quantifies the cost of 
producing the enzyme and the re gula tory protein; the factor M is the maximal capacity for 



producing non-essential proteins [12] • Note that we assume that the costs of producing one 
enzyme molecule and one gene regulatory protein molecule are the same. 

As discussed in the introduction, a given average optimal expression curve of -E as a 
function of sugar concentration, Eopt{L), can be obtained by different expression levels of 
X. A mean-field analysis, which ignores the effect of fluctuations in E and X, would 
predict that the optimum expression level of X is close to zero, since that minimizes the 
cost of producing the regulatory protein. We therefore assume that the steady-state enzyme 
expression level, E^, is given by that level E^^^ that maximizes AA with respect to E at 
X = 0. The steady-state enzyme expression level is thus given by 



^. = ^o% = Mh- J| . (20) 



This expression is, in fact, the principal resu 



t of the cost-beneflt analysis of the optimal 



enzyme expression level of Dekel and Alon [1^ . The expression, with 6 being a function of 

y g ood prediction for 



12l | . The prediction is 



the lactose concentration (see Equation [26] below), gives a remarkab^ 
the enzyme expression level as a function of the lactose concentration 
shown in Figure [3]C We now address the question what is the optimal regulatory network — 
the optimal TP concentration Xg, the optimal TF-L and TF-operator binding strengths — 
under the assumption that the steady-state enzyme expression level as a function of lactose 
concentration is flxed and given by Equation (12U1) : E^^L) = E^p^{L). 

To obtain the growth rate a.t E = Es + e and X = Xs + x (with flnite Xg) , we expand the 
growth rate around E^^^^ and X = 0, which yields the following expression for the relative 
growth rate (see Methods): 

On the left-hand side of the above equation, g is the growth rate of the population of cells. 
The flrst two terms on the right-hand side give the deterministic, mean-fleld prediction that 
ignores the effect of fluctuations in x and e: in the absence of fluctuations, the growth rate 
of the population of cells, g, equals the growth rate of each single cell. Ad, which is given 



by Ad = Ao + Aq 



{VS - ^JrifM - 5Xg) (see Methods). The last term of Equation ([21]) 



describes the effect of fluctuations on the growth rate. The second term on the right hand 
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side shows that at the mean-field level, there is indeed a pressure to minimize the production 
of the regulatory protein X; this is associated with minimizing the cost of producing the 
regulatory protein. The third term on the right hand side shows, however, that there is also 
a pressure to minimize the fluctuations in X, given by a^. Its origin is that fluctuations 
in the gene regulatory protein X lead to fluctuations in E, and since the mean expression 
level of E is assumed to be at its optimum, these fluctuations tend to lower the growth rate. 
Importantly, the magnitude of the fluctuations in X and hence E decreases as the average 
expression level of X increases. Clearly, while the cost of producing X tends to lower the 
optimal expression level of X, the beneflt of reducing the fluctuations in E tends to increase 
the optimal expression level of X. The optimal expression level of X is determined by the 
balance between these two opp osing factors. A similar conclusion was recently independently 



reached by Kalisky et al. 



To demonstrate this explicitly, we will study in more detail the last two terms in Equation 
( pT]) . which describe the contribution of the transcription factor to the growth rate: 

(22) 

In our model, the steady-state enzyme concentration is given by E^ = Eopt = ^e(^s5 L)/'J, 
which means that the gain is given by 

/ dE, E, 




7 dX, 



(23) 



le fluctuations in X. If we assume 
3| , where V is the volume and Nx 



To make further progress, we have to assume a model for t 
that these fluctuations are Poissonian, then — Nx/V^ 
is the copy number of X. Recent results show that while the fluctuations can be stronger 
than Poissonian due to, for example, bursts in gene expression, the linear scaling of cr^ 



remains correct for many proteins in prokaryotes 
the expression in Equation (!22|) is proportional to 



31| . Finally, if we assume that Eg oc M, 



f./Vx). (24, 

This expression shows a maximum as a function of A^x- The position of this optimum — the 
copy number of X that maximizes the growth rate — is related to the copy number of E by 

Nx oc ^iVE. (25) 



17 



We therefore predict that the optimal TF copy number is hnear in the square root of the copy 
number of the enzyme it regulates. This prediction could perhaps be tested by performing 
a statistical analysis of the expression levels of transcription factors and the expression 
levels of the target genes these transcription factors regulate. Such a statistical analysis 



could be performed in the spirit of that of Ref. (3l|, in which the authors studied the 
variation in the expression levels of 43 Saccharomyces cerevisiae proteins, in cells grown 
under 11 experimental conditions. Our analysis would predict that if one would measure 
the expression levels of transcription factors and their target genes in such an experiment, 
the two would be correlated according to Equation ( l25i) . 



12| measured the quantities 6 and rj used above (Equation [12]) for the 



Dekel and Alon 
lac system: 

, = 0.02£-, <^-0.17i;w!, o,^M + L - 

(where L is measured in mM units). Here -Ewt is the fully induced wild-type concentration of 
the enzyme, and we use M = 1.8£^wt- As explained in the section Fluctuating Environment 
the growth rate in a slowly fluctuating environment can be obtained as an average over 
the different levels of the lactose in the environment. As we do not know the wild type 
distribution of sugar the bacterium experiences, we use either a uniform distribution over 
all possible lactose levels in the interval 0-6mM or a non-uniform bimodal one that peaks at 
small and high lactose concentrations. 

Figure [2] shows the optimal repressor expression level, for the two different lactose distri- 
butions in the environment. It is seen that the growth rate as a function of the copy number 
of the regulatory protein exhibits a broad optimum at around 10-50 molecules. Interestingly, 



this is in the biological range [2J]. Even though our model of gene expression is rather sim- 
plified (we use, e.g., a constant amplification factor /), it appears that the prediction of our 
model is remarkably accurate. Interestingly, Kalisky et al. arrived at a similar prediction, 
even though their model differs in a number of ways from ours, as discussed in more detail 
in the Discussion section 1^ . 

Equation fl2ip shows that the effect of the noise in X, o"x, on the fluctuations in E, and 
hence on the growth rate, is determined not only by the decay rate 7, which controls the 
extent to which fluctuations in X and E lead to significant differences between cells in their 
composition on the time scale of the cell cycle, but also by the gain /, which determines 
the extent to which the fluctuations in X are amplified. As we will show now, the optimal 
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TF-ligand binding curve and TF-operator binding curve is determined by the requirement 
that the gain / should be minimized as much as possible. Let's imagine that the binding of 
the ligand L to the repressor X is given by 

= ^Tl- ^^^^ 

Here, X is the total TP concentration, Xfrec is the concentration of X that is not bound to 
the inducer, and is the dissociation constant for ligand-TF binding. The unbound tran- 
scription factor represses the expression of E via the repression function R = R{Xi,.ee{X, L)), 
given by 

7 7 

We show these relations in Figure [31 It is important to note that the repression function 
R{X{j-ee{X, L)) is not necessary a simple Hill function; in the lac system this curve is known to 
be implemented with a complicated cooperative interaction and binding to multiple operator 
sites on DNA. Using Equation ([231), dEjOX^ = ^ gjf ^fe" Equation ([27[), we arrive 

at 

7 " "^Z ^ 

To minimize the gain /, and hence the effect of noise in X on the growth rate, should 

be as small as possible, which corresponds to strong TF-L binding. Since the function 
Eopt{L) is assumed to be fixed, strong TF-L binding also implies strong TF-operator 
binding. Hence, as long as TF-ligand binding and TF-operator binding can be integrated 
out, the best strategy would be strong TF-L and TF-operator binding. This is illustrated 
in Figure [H which shows for the lac system the contour plot of the optimal growth rate in 
the plane {X, Kd). The conclusion that TF-L and TF-operator binding should be strong 
is supported by the experimental observation that the dissociation constant for the binding 
of lac repressor to its primary operator site is in the nM range, while the binding of the 



inducer allolactose to the repressor is on the order of 0.1 /iM 32 1. 



Discussion 

The response machinery allows a living cell to adjust its composition to a changing envi- 
ronment. If the response machinery is fast and operates well, then in each environment the 
cell's composition is optimized such that the growth rate is maximized. Our analysis suggests 



19 



that under these conditions, there is an evolutionary pressure to minimize the fluctuations 
in the composition. However, the response machinery cannot always optimally adjust the 
cell's composition. When there is a drastic change in the environment, for instance, the cell 
probably has to change its genotype so as to change its response machinery. Our analy- 
sis suggests that along such an "evolutionary trajectory" from a sub-optimal conflguration 
of the response machinery to a new optimal one, fluctuations in the composition could be 
beneflcial, because cells that happen to have a composition that is closer to the new opti- 
mum will grow more rapidly and thereby increase the overall growth rate of the population. 
Based on this observation we predict that the periods of fast evolution (for example when a 
population colonizes an entirely new environment) are correlated with a positive influence 
of fluctuations and thus an increased variability in the population. This idea is supported 
by the observation that the regulatory networks that control the response to environmental 



changes are in general noisier than the conserved cell machinery [3l|, |33 1 . 

It has been recognized before in a different context that phenotypic variance can be 
detrimental under stabilizing selection for the optimal genotype and advantageous far from 



this optimal genotype 
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161 ]. Moreover, it has been suggested that phenotypic variance 



13|. Our 



could be maintained if there is an "engineering" cost of minimizing fluctuations 
model, however, makes it possible to make a quantitative prediction on the effect of protein 
concentration fluctuations on the growth rate of a clonal population of cells. In particular, 
the model predicts that the effect of fluctuations in the concentration of a given protein X 
depends upon the following quantities (see Equation [T2|) : a) the growth rate of a single cell 
as a function of the expression level, A(X) 12|; b) the strength of the fluctuations in X, 
c) the correlation time of the fluctuations in X, given by 7. All these quantities can be 
measured experimentally, which would allow for a quantitative test of our model. In this 
respect, it would be of particular interest to investigate one of the key ingredients of our 
model, which is how the growth rate of a single cell. A, depends on the composition X. We 
have assumed that the growth rate depends upon the instantaneous composition, but it is 
conceivable that the growth rate responds to changes in the composition with a time lag; 
alternatively, it could depend upon the composition as averaged over some time scale r: 
A = A(X"). 

Recently, Kalisky, Dekel and Alon jl^ reported an analysis of the optimal design of the 
gene regulatory network that controls the expression of the lac operon, which complements 
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ours. While we assume that the correlated fluctuations in the environment are slow, they 
also consider correlated fluctuations in the environment that are relatively fast to the re- 
sponse time; on the other hand, their analysis does not address the question of the optimal 
dissociation constants for inducer-TF and TF-operator binding. Our analyses also differ in 
the description of the extrinsic contribution to the noise in the expression of the lac operon, 
and in the estimate of the burst size of lac expression. More importantly, Kalisky et al. 
used a simpler model to describe the effect of biochemical noise on the growth rate of a 
population of cells. Our model integrates a description of the effect of noise on the growth 
rate of a single cell with a description of how the growth rates of the single cells collectively 
determine the growth rate of the population. In contrast, their model assumes that the 
growth rate of the population is given by the average of the growth rates of the individual 
cells. This approximation does not allow the model of Kalisky et al. to predict that the 
noise can also enhance the growth rate of the population. This is indeed an effect that arises 
at the population level; it is a consequence of the fact that cells that happen to grow faster 
will take over the population. Moreover, our work illustrates the importance of the correla- 
tion time of the protein concentration fluctuations. However, the present work agrees with 
that of Kalisky, Dekel and Alon in that we both flnd that the optimal concentration of 
a gene regulatory protein is determined by the interplay between the cost of synthesizing 
the regulatory protein and the beneflt of reducing the fluctuations in the expression of its 
target gene. Even quantitatively, the predictions of our models for the optimal lac repressor 
concentration are fairly similar, although the model presented here would predict a slightly 
lower optimum concentration and a slightly smaller change in growth rate for deviations 
away from this optimum; this could be due to our conservative estimate of the burst size. 

Our model predicts that if the expression level of the gene regulatory protein is varied 
by a factor 2 from its optimal value, the change in the growth rate would be on the order 
of 10~^. This change is sufficient to provide a selection pressure that is large enough in a 
typical bacterial population with an effective size larger than 10^ cells; indeed, as discussed 
in 34], relative growth rate changes as low as 10~^ are sufficient to balance the genetic drift 
in such a population. A change in the growth rate of 10""^ is thus large enough to provide 
a selection mechanism in a typical bacterial population for driving the transcription factor 
expression level to within a factor 2 from the predicted optimal level. 

Another fundamental question we can address with our model is the relative efficiency. 
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from the fluctuations point of view, of different modes of regulation (see Methods). For 
example, the cost-benefit function of Dekel and Alon implies that the cost grows with a 
linear combination of the total enzyme and transcription factor concentration, with positive 



coefficients 



12| . As a consequence, regulatory networks with anticorrelated fluctuations of 



the enzyme and TF concentrations, which correspond to repressor based regulatory networks, 
will provide an advantage over those with correlated fluctuations, as for activator based 
regulatory networks. This result is consistent with the observation that simple organisms 
have more repressors than activators. Unlike alternative explanations for this observation 
)ased on the requirement for genotypic robustness with respect to mutational fluctuations 
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36| . our explanation does not require that the rate of environmental fluctuations is 



comparable to the slow relevant mutation rates. 

In this paper, we have focused on the expression of a single protein. Yet, it is clear that 
the model presented in Growth rate could be used to study more comphcated networks as 



well. In these networks, the propagation of noise (37|, l38|, l39|, |40| and hence the effect of noise 



on the growth rate, can be intricate, especially when there are (anti-) correlations between 



different sources of noise 
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411]. The model could also be used in conjunction with partial- 



differential equation solvers to study non-linear networks, for which biochemical noise is 
expected to become even more important. 

How could our predictions be tested experimentally? Ideally one would like to perform 
an experiment in which the average expression level of the metabolic enzyme is fixed, while 
the noise in the expression level is varied. Several strategies could be envisioned. First of all, 
one could vary the noise level by playing with the transcription and translation efficiencies 







3|, |38j. To make more direct contact with the predictions presented here, however, it would 
perhaps be more interesting to vary the expression level of the regulatory protein, while 
simultaneously varying the TF-operator binding strength such that the average expression 
level of the metabolic enzyme remains constant. Alternatively, one could vary the expression 
level of the regulatory protein, while simultaneously changing the concentration of an 
artificial inducer such that the enzyme concentration remains constant. For example, it is 
possible to increase the binding affinity of the lac repressor to the operator, and therefore 
ihe repression strength by a factor as high as 10, by either mutating the repressor Lad 



42] or the operator sites 43|. Our analysis predicts that the growth rate as a function of 



the expression level of the regulatory protein exhibits a broad maximum as shown in Figure El 
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Methods 

The stationary distribution Ps{Z,x) 

In this section we derive the solution (Equation [Tj) for the stationary probability distribution 
Ps{Z, x). The equation satisfied by P{Z, x,t) for the case of linear Langevin dynamics is: 



dP 
'dt 



djXP) 



9{t)P 



d^P , d{U,x,P) 



dxi 



(30) 



The three terms on the right hand side of Equation ( 130|) describe, in order, the drift along 
the cell-cycle coordinate Z, the normalization of P due to the continuous birth of new 



cells in the population, and the Fokker-P 
of the composition of the individual cells 



ank operator describing the internal dynamics 
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44| . The diffusion strength Dij is given by 



{'niXli) = '2Dij, where (?7i?7j) are the cross-correlations in the Gaussian white noise of Xj and 
Xj 281]. The stationary solution satisfies the equation: 







dZ 



^ ^ dxidxj dxi 



3 



with the boundary condition 2Ps{Zf,x) = Ps{Zi,x). 
The instantaneous growth rate is given by: 

X{x) = Ao + ^ ttiXi + y^fe, 



ijXiXj . 



(31) 



(32) 



For the stationary distribution we make the Ansatz 



Using the scaling Z\ — Zi = log(2) we obtain 



P (Z X) ~ Q-'y^-^'^^ Q-\T.ijOL^j{x,-x\){xi-X^f, _ 



(33) 



(34) 



If we insert this Ansatz into Equation (1311) . we obtain 



g = Ao+^ aiXi+^ bijXiXj-^ /**+E Dijaik{xk-xl)aji{xi-x'l)-^ fijXjaik{xk- 



-Xi 



ijkl 



ijk 



(35) 
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For this multidimensional polynomial equation to be satisfied for all the values of x we must 
have that all the coefficients are zero. Therefore the growth rate is given by: 

g = \-^ Dijaij + X] /n + Aj {a-ikxl) {ajix'}) , (36) 

ij i ijkl 

where the constants cx and are obtained by solving the set of "'•"^''"^^ equations: 

= ai - 2 ^ Djkajix^aki + ^ fjiCtjkxl, Wi, 

jkl jk 

= bij + ^Dkiakiaij -"^fkiakj, (37) 

kl k 

We can read from the Equations fl37|) that negative curvatures of the instantaneous ad- 
vancement rate {bi < 0) concentrate the Gaussian stationary distribution Ps{Z,x) (induce 
larger a's), while non-zero values for Oj displace the averages x° of the Gaussian stationary 
distribution Ps{Z,x) such that a^x^ > 0. 
Growth rate controlled by a single enzyme 

We derive here Equation ( |T2l) . As discussed in the text, we model the dynamics of enzyme 
X via the linearized Langevin dynamics, 

X = — 7X -|- rj, (38) 

while we assume that the growth rate of a single cell as a function of the expression level of 
X can be written as 

A = Ao(Xs) + ax + bx'^. (39) 

We must solve the equation 



n ^(^-Ps) p 



D 



dx"^ dx 



(40) 



where we choose D such that the strength of the biochemical noise rj is 2D 4J]. To obtain 
the stationary distribution, we make the Ansatz 

P,(Z,x)~e-(^-^»)e-5^("-"'')'. (41) 

If we insert this into Equation ( l40i) . we find that we have to solve the equations 

^ = Ao - D/a^ + + D 
= a - 2Dx7a^ + -ix^/a\ 
= 6 + L'/cj^ - 7/a^ (42) 
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X ^ 



from which we obtain the solution 



r 



2D 



a 



7 + - ^bD 
Cost-benefit analysis of gene regulation 

We now present the derivation and the approximations leadin g to Equation (12T]) . A mean 



(43) 
(44) 



field analysis of the cost-benefit function of Dekel and Alon 1^, Equation f|T9|) . predicts 
that the maximum growth rate occurs at E^^^ = M (l — -\/|) and X = 0. We are interested 
in the growth rate of a cell in which the average enzyme concentration is Es = -Eopf while 
the average transcription factor concentration, Xg, is not zero, but finite. Since the average 
transcription factor concentration, Xs, is nevertheless small, it is reasonable to assume that 
the growth rate of a cell with E = Eg+e and X = Xg+x can be obtained by Taylor expanding 
the growth rate given by Equation (fT9|) around the deterministic prediction, E = Eg = E^^^, 
X = 0. This yields 

A = Ad + Ao [aie + + b{x + e)^] , (45) 



where 



ai = 0, 



02 



-5, 




(46) 



Here, Ao is the growth rate of each single cell when the gene regulatory protein and the 
enzyme are not expressed 45] . The rate Ad is the "deterministic" growth rate, thus the 
growth rate when the regulatory protein and the enzyme are expressed, but fluctuations are 
not taken into account. It is given by: 



Ad — Ao + Ao 
Remark that at zero Xs we have: 



Ad — Ac 
Ao 



(47) 



(48) 



Equations (136|) and (1371) can now be solved using Equations (145] - [47|) to obtain the growth 
rate that takes into account the noise. This leads to the following expression for the growth 
rate: 



An 



/ 2/ 

.7 7 



4 + (49) 

7 
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In deriving Equation (H9|) we also use the fact that the transcription factor concentration 
is much smaller than the typical enzyme concentration, yielding ^ ^ 1. We also use the 
inequalities 5,7] < ^ |45| and the Poissonian nature of the noise in the transcription factor: 
c^y^ = Y^- Equation fl49l) can be further simplified by keeping in our approximation only 
the terms of order one or larger in the small ratio Please note that in the absence of 
fluctuations, the above equation reduces to g = X^: the growth rate of the population of 
cells, g, then equals the growth rate of each single cell. Ad- 

The last term in Equation (l49l) is positive, and, interestingly, promotes fluctuations in 
X. It comes from the finite derivative at X = 0, as explained in Biochemical noise can both 
reduce and enhance the population's growth rate. However, 

(50) 



^ M2 X2M2 N^M^' 



Therefore, the last term in Equation (149 p is negligible at our level of approximation. 
We also have 

2^ Hx- < 2 ^ 2^ 

M^T]^ \Jr]M^ - M2' 

while 

O ^ 2 o^s 1 

2 — \ -(ri < 2—^ . 

M]j r] ^ M^Nx 

We can therefore simplify Equation fH9l) to the form 



(51) 



(52) 



^0 







2/ 

h — 


/ V 


^2 


7 



(53) 



Around the steady state - = and we thus also have A- ^ - — 4^ ^ 1, such that 
we can simplify Equation ( l53l) further. Nevertheless, it is important to remark that positive 
regulation (/ > 0) increases the detrimental effect of fluctuations in the concentration of 
the gene regulatory protein. Hence, at this level the cost of biochemical noise is smaller 
for repressors than for activators. Finally, Equation fl2T|) of the main text is obtained by 
neglecting the term ^ in Equation (!53l) . 

If the response times of the enzyme and the transcription factor are not equal, the same 
analysis gives 



Ar 



An 



M{y/6 - - 6X, 



M(l + 5 




,7i 



- + — 



2/ 

7e 



X) 



(54) 
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where 7x is the degradation rate, i.e. the response time, of the transcription factor and 
7e is the degradation rate (response time) of the enzyme. This shows that the effect of 
the fluctuations in the transcription factor concentration, X, critically depends upon the 
response times of X and E\ only when X fluctuates more slowly than the time scale on 
which E can respond to these fluctuations (7x < 7e); are the fluctuations in X propagated 
effectively to fluctuations in E. In contrast, if the fluctuations in X are fast compared to 
the response time of E (7x > 7e), then the slow enzyme dynamics will effectively integrate 
out the fluctuations in X; indeed, the last term on the right hand side of the above equation 
is then small. 
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LIST OF FIGURES 



1. A sketch of the instantaneous growth rate A of a single cell as a function of the con- 
centration X of component X. If the average expression level Xs is close to the optimal 
expression level Xopt, biochemical noise will always decrease the growth rate. If, how- 
ever, the average expression level deviates sufficiently from the optimal expression level 
(i.e. if ax > bx^ in Equation [TTl) . then fluctuations can enhance the growth rate of the 
population, even when the growth rate A of a single cell is linear in X, i.e. if 6 = 0. 
The reason is that fast growing cells dominate the population. 

2. Relative change in the growth rate as a function of the average repressor concentration. 
The growth rate is averaged over different lactose concentrations in the environment 
(see Equation [T7|l . for two different lactose concentration distributions in the environ- 
ment. 

3. Different regulatory networks can yield the same optimal enzyme expression level as 
a function of inducer concentration. This is illustrated for two regulatory networks of 
the lac system, which differ in the dissociation constants of lactose-repressor binding 
and repressor-operator binding. Panels a) and b) show the response functions at 
two different stages of the lac regulatory network, while panel c) shows the resulting 
optimal enzyme expression level as a function of lactose concentration, a) The fraction 
of repressor that is not bound by lactose, Xf^cc/^, as a function of lactose concentration 
for two different lactose-repressor binding constants, b) The corresponding response 
curves of the enzyme expression level as a function of the fraction of free repressor. 
The total expression level of repressor is chosen to correspond to the optimal growth 
rate (see Figure [2]). c) The resulting optimal enzyme expression level as a function of 



the lactose concentration, as predicted by Equation (l20ll |l2|. 



4. The optimal design of the lac regulatory network is determined by the lac repressor 
copy number and the repressor-lactose binding constant. Contour plot of the growth 
rate as a function of the repressor copy number X and repressor-lactose binding con- 
stant Ky). The weighting of the lactose levels is nonuniform. Lower binding constants 
allow for higher optimal growth rates at lower optimal expression levels for the repres- 
sor. 
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FIG. 2: Tanase-Nicola and Ten Wolde 
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FIG. 3: Tanase-Nicola and Ten Wolde 
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FIG. 4: Tanase-Nicola and Ten Wolde 
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